On Bioinformatic Resources

نویسنده

  • Runsheng Chen
چکیده

A starting point of curating bioinformatic resources for the public is marked by the establishment of the US National Center for Biotechnology Information (NCBI) in 1988 [1]. One of its many purposes is certainly to echo the initiative of the Human Genome Project (HGP) –– when two landmark reports were published at the same time: ‘‘Mapping and Sequencing the Human Genome’’ by the National Research Council [2] and ‘‘Mapping Our Genes –– The Genome Project: How Big, How Fast?’’ by the US Congress [3]. As HGP is prepared to scale up its sequencing operations about 5 years into the Project, a new discipline – bioinformatics – became inevitable. It emerged originally for processing and sharing genome sequences and was thus known as genome informatics. In the first five-year plan of HGP, it was pointed out that genome informatics is a scientific discipline that encompasses all aspects of genome information acquisition, processing, storage, distribution, analysis, and interpretation [4]. This insightful statement was footnoted by the launch of GenBank at NCBI in 1992, a database of nucleotide sequences, in collaboration with its international partners at the European Molecular Biology Laboratory (EMBL) and the DNA Data Bank of Japan (DDBJ). Now, some 20 years have passed, not only GenBank still prospers but other databases on human genomics and biology have emerged to fill up all data landscapes [5]. Genomics together with many other research fields of life sciences had entered the Era of Large-scale Data Acquisition in the early 90’s. The Era was led by the fast accumulation of human genomic sequences and followed by similar data from other large model organisms [6]. As soon as HGP declared the human genome sequenced in 2001 [7], the HapMap Project [8] was announced to sequence hundreds of human genomes in a diverse population background. Microbial genomics has also been pursued into both metagenomics and pangenomics [9,10]. The big data collection efforts have been extending all the way into its vertical path –– from DNA to RNA to Protein [11,12]. Another expansion of the efforts is horizontally toward biological functions and disease relevance [13–15]. In order to annotate biological functions at different levels of anatomy and physiology –– cell, organ, tissue, and systems –– for biomedical applications, data integration and interpretation become more relevant, and thus the birth of systems biology, which takes account of data from all omics fields and deciphers them in a context of biological networks. In recent years, with strong funding and rapid development, the sequence technology has been advancing significantly, resulting in dramatic cost reduction and explosion of data accumulation. Therefore, a large number of bioinformatic resources become obvious and there is demand for some comprehensive digestions. There are several data types being curated in bioinformatic resources world-wide. The first of them is classified as scale-up omics data, which includes genome-wide data from the vertical path as mentioned above: genome, transcriptome, proteome, and metabolome, just to name a few. Each of these data may have different disease relevance, such as DNA variations for cancer predisposition and protein structures for drug development. The second type of data resources includes data generated by molecular biologists that tend to look at molecular details, such as transcription factors and signal transduction pathways, where interactions among macromolecules and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bioinformatic and empirical analysis of a gene encoding serine/threonine protein kinase regulated in response to chemical and biological fertilizers in two maize (Zea mays L.) cultivars

Molecular structure of a gene, ZmSTPK1, encoding a serine/threonine protein kinase in maize was analyzed by bioinformatic tool and its expression pattern was studied under chemical biological fertilizers. Bioinformatic analysis cleared that ZmSTPK1 is located on chromosome 10, from position 141015332 to 141017582. The full genomic sequence of the gene is 2251 bp in length and includes 2 exons. ...

متن کامل

Bioinformatic approaches to augment study of epithelial-to-mesenchymal transition in lung cancer.

Bioinformatic approaches are intended to provide systems level insight into the complex biological processes that underlie serious diseases such as cancer. In this review we describe current bioinformatic resources, and illustrate how they have been used to study a clinically important example: epithelial-to-mesenchymal transition (EMT) in lung cancer. Lung cancer is the leading cause of cancer...

متن کامل

Bioinformatic training needs at a health sciences campus

BACKGROUND Health sciences research is increasingly focusing on big data applications, such as genomic technologies and precision medicine, to address key issues in human health. These approaches rely on biological data repositories and bioinformatic analyses, both of which are growing rapidly in size and scope. Libraries play a key role in supporting researchers in navigating these and other i...

متن کامل

GPS@ Bioinformatic Portal: from Network to EGEE Grid

Bioinformatic analysis of data produced by complete genome sequencing projects is one of the major challenge of the current years. Integrating up-to-date databanks and relevant algorithms is a clear requirement of such analysis. Grid computing would be a viable solution to distribute data, algorithms, computing and storage resources for Genomics. Providing the bioinformaticians with a good inte...

متن کامل

Bioinformatic Resources of microRNA Sequences, Gene Targets, and Genetic Variation

Variation in quantitative gene expression has been observed in natural populations and associated with various complex traits/phenotypes such as risks for common diseases and drug response. MicroRNAs (miRNAs), a family of small, non-coding RNA molecules, have been demonstrated to be an important class of gene regulators that mostly downregulate gene expression at the post-transcriptional level....

متن کامل

Petra, osiris and molinspiration: A computational bioinformatic platform for experimental in vitro antibacterial activity of annulated uracil derivatives

Annulated pyrano[2,3-d]pyrimidine/pyrano[2,3-d]uracil derivatives were synthesized using aromatic aldehydes, active methylene compounds and barbituric acid in presence of dibutylamine (DBA) catalyst in ethanol as solvent. The different substituents on phenyl ring in the fused pyrano uracil skeleton showed productive influence on its antimicrobial activity against some gram positive and gram neg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 13  شماره 

صفحات  -

تاریخ انتشار 2015